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Abstract 

We consider the size and structure of the automorphism groups of a variety of empirical 'real-world' 
networks and find that, in contrast to classical random graph models, many real-world networks are 
richly symmetric. We relate automorphism group structure to network topology and discuss generic 
forms of symmetry and their origin in real-world networks. 
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1 Introduction 



The use of complex networks to model the underlying topology of 'real-world' complex systems - from social 
interaction networks such as scientific collaboration networks [THl fT^ to biological regulatory networks [H] 
and technological networks such as the internet [22] - has attracted much current research interest [I| [20l [24] . 
Previous studies have highlighted the fact that seemingly disparate networks often have certain features in 
common including (amongst others): the 'small- world' property [5S]; the power-law distribution of vertex 
degrees [2; and network construction from motifs[T7]. 

Identification of universal structural properties such as these allows generic network properties to be 
decoupled from system-specific features. In this present work we consider the symmetry structure of a 
variety of real-world networks and find that a certain degree of symmetry is also ubiquitous in complex 
systems. 

We consider network symmetry via the automorphism group of the underlying graph. Firstly, we identify 
'essential' network symmetries and use these symmetries to derive a natural direct product decomposition of 
the automorphism group into irreducible factors. This decomposition is per se a very efficient way to handle 
large automorphism groups of real- world networks. We associate with each factor in this decomposition a 
symmetric subgraph - the subgraph on which the factor subgroup acts non-trivially - and investigate the 
generic structure of symmetric subgraphs. 



2 Network Automorphism Groups 

Mathematically, a network is a graph, = ';^{V,E), with vertex set, V (of size Ncg), and edge set, E (of 
size Mc^) where vertices are said to be adjacent if there is an edge between them. An automorphism is a 
permutation of the vertices of the network which preserves adjacency. The set of automorphisms under 
composition forms a group, Aut(^), of size a^fS]. Throughout this discussion we shall let ^ refer to a 
generic network, and G to a generic group. If the network is a multi-digraph, we remove weights and 
directions and consider the automorphism group of the underlying graph. 

Here, the nauty programfTBI - which includes one of the most efficient graph isomorphism algorithms 
available[9] - is used to calculate the size and structure of the various automorphism groups. 

Table H] gives the order of the automorphism group of some real- world complex networks, all of which are 
nontrivial. Since almost all large graphs (including, for example, the classical Erdos-Renyi random graphs) 
are asymmetric ^ this symmetry is somewhat unexpected and begs an explanation. 

Many networks ~ for example the internet and the world wide web - are 'growing' [2] (that is, new vertices 
are added to the network over time). Generically, any growth process in which allows for new vertices to 
be added to the network one at a time naturally leads to a network with locally tree-like regions. Such 
locally tree-like areas are common in real-world networks and their presence is important because, while 
the majority of large graphs are asymmetric, it is common for large random trees to exhibit a high degree 
of symmetry [13], deriving from the presence of identical branches about the same fork. Thus we expect a 
certain degree of tree-like symmetry to be present in many real-world networks. In the following sections 
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Network 








Human B Cell Genetic Interactions 3^ 


5,930 


64,645 


5.9374 X 10" 


C. elegans Genetic Interactions [26j 


2,060 


18,000 


6.9985 X lO^^i 


BioGRID datasets[5B]: 








Human 


7,013 


20,587 


1.2607 X 10^*^5 


S. cerevisiae 


5,295 


50, 723 


6.8622 X IQ^^ 


Drosophila 


7, 371 


25,043 


3.0687 X 10^93 


Mus musculus 


209 


393 


5.3481 X 10^25 


Internet (Autonomous Systems Level) [12] 


22,332 


45,392 


1.2822 X 10"'298 


US Power Grid|25j 


4,941 


6,594 


5.1851 X 10^52 



Table 1: The size of the automorphism group of some real- world networks. The size of the automorphism group 
of the largest connected component is given (to 5 significant figures) which, in all cases, contains at least 93% of 
the vertices in the network. Connected components were extracted using Pajek^]- 

we determine the extent to which real-world symmetry is locally tree-like. We begin by considering the 
structure of network automorphism groups. 



3 Factorization of Automorphism Groups 

In this section we describe a computationally efficient factorization of large network automorphism groups 
- which are often too large to allow direct analysis (see Tabled]) - into 'irreducible building blocks'. 

Consider the permutations of a set of n points X ~ {xi, . . . ,Xn}- The support of a permutation p is 
the set of points which p moves, supp(p) = {xi \ p{xi) ^ Xi}. Two permutations p and q are disjoint 
if their supports are non-intersecting. If p and q are disjoint then they commute (with respect to the 
composition of permutations). Similarly, two sets of permutations P and Q are support- disjoint if every 
pair of permutations p £ P and q € Q have disjoint supports. 

Let '^^ he a network with automorphism group Aut(^^). Let S" be a set of generators of Aut($^). Suppose 
that we partition S into n support-disjoint subsets = U . . . U 5^ such that each Si cannot itself be 
decomposed into smaller support-disjoint subsets. Call Hi the subgroup generated by Si. Since 5 is a 
generating set and elements from different factors Hi, Hj commute, this procedure gives a direct product 
decomposition: 

Aut(^) = i7i X i/2 X .. . X i/„. (1) 

Note that, in general, the choice of generators of a group is not unique and different choices of generating 
sets may give different decompositions. Thus, for this decomposition to be well-defined, we need to show 
that it is unique and the factors in Eq. ([1]) are 'irreducible'; that is, they cannot be written as K x L with 
K and L support-disjoint subgroups. 

A group G is support-indecomposable if it cannot be written as K x L with K and L support-disjoint 
subgroups. Similarly, a set S is support-indecomposable if it cannot be written as 5*1 U 52 with 5i, 5*2 ^ 
both support-disjoint subsets. 
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Proposition 3.1. The subgroups in Eq.mi are independent of the choice of generators (that is, unique) 
and support-indecomposable (that is 'irreducible') when the generating set S satisfies the following two 
conditions 

(*) S does not contain elements in the form s — gh with g,h ^ 1 and g, h support- disjoint; 

if a subset S' <Z S generates a subgroup H < G such that H — Hi x H2 with Hi and H2 support- 
disjoint then there exits a partition S" = 5i U ^2 such that Si generates Hi . 

Note that these conditions are ensured if, for example, the nauty algorithm is used to calculate the 
generators of Aut(J#) (see parts (1) and (2) of Theorem 2.34 in [TB]). The proof of proposition 13.11 can be 
considered in two parts: irreducibility and uniqueness. 

Proposition 3.2. (irreducibility) Let S be a finite set of permutations and H the group generated by S. If 
H is support-indecomposable as a group, then so is S as a set. The converse is also true when S satisfies 

Proof. The first claim is clear. For the converse, suppose that S = {si, . . . , s„} is support-indecomposable 
as a set but H = K x L {K, L support-disjoint). Then si = kl ioi k Q K , I ^ L. By condition {*), k — 1 
or I = 1, that is, si €1 K or si G L, and similarly for S2, . . . , Sn- Thus S ~ {S H K) U (S* fl L). Since S is 
support-indecomposable as a set, S d K or S r\L \s empty, that is, S" C X or C L. Hence H = K and 
L = 1 or H = L a.nd K ^ 1. □ 

Proposition 3.3. (uniqueness) Suppose that X and Y are two sets of generators of a permutation group 
G, with associated direct product decompositions 

G = Hi X ... X Hn, 
G ^ Ki X . . . X Km ■ 

If both X and Y are essential, then n — m and there is a permutation a of the factors such that Hi = -K^o-(i) 
for i — 1, . . . , n. 

Proof, (sketch) Firstly, generalize condition to a finite number of subgroups Hi, . . . , H„, by induction 
on n. Then apply this to the first set of generators X with respect to the second decomposition. We 
then have a partition X = Xi \J . . . U X„i such that Xi generates Ki (1 < i < m). Suppose that Hi is 
generated by a set {xi, . . . ,xt} C X . Since Hi and Xi, . . . , X^ are support-indecomposable, we must 
have {xi, . . . ,xt} C Xi-^ for some ii. That is, Hi C Ki-^. Since Ki-^ is support-indecomposable this implies 
Hi = Ki-^. The same argument applies for H2, . . . , Hn. □ 

Thus, the decomposition given in Eq. ([T]) is well-defined if (for example) the nauty algorithm is used. We 
shall refer to this decomposition as the geometric decomposition, and note that it is a simple variation 
of the KruU-Schmidt factorization into the direct product of indecomposable subgroups FSTl. A GAP [10] 
procedure which calculates the geometric decomposition for an arbitrary permutation group is available 
from the authors on request. 

In general the geometric decomposition is coarser than the KruU-Schmidt decomposition since non-disjoint 
permutations may still commute. The KruU-Schmidt decomposition may easily be obtained from the ge- 
ometric decomposition using a computational group theory package such as GAP. The main advantage to 
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Network Aut(^#) % Basic Factors 

Human B Cell Genetic Interactions x S"! x 5*4 97.4 

C. elegans Genetic Interactions X Sf xSfx Si X Si X X Ss X S^ x 98.7 

X Sii X S33 X (C2 I C2) 

BioGRID datasets: 

Human Cf ^ x Sl° x Sf x S^"^ x Sl° x Sj x Si x Sl x 99.5 

Sw X Sii X SI2 X 5i5 X Sfg X Si7 X S23 X S26 X ^44 

5". cerevisiae x 5f x S*! x S^ x 5| x Sj x Su x Sn 100 

Drosophila Cf^ x Sf x Sf x Sl^ x ^g" x S]° x 5| x S*! x 99.2 

-5*10 X S"?! X SI2 X SiA X 5i6 X S'20 X S'30 

Mus musculus C'l x S^ x Si x S5 x S^ x Ss x Sio x Sn x 100 

5*12 X S'26 X 5*44 

Internet (Autonomous Systems Level) Cf ^ x Sf^ x Sj^'' x S^" x 5f x S^^ x S-f x 98.4 



C47 y C32 y 024 y Cl4 „ y Cl3 y 08 y ^9 

'-'9 ^ '-'10 ^ "^ll ^ '-'12 ^ '^13 ^ '-'14 ^ "^15 ^ "^ll 
^[7 X Sf^ X Sll X Slo X X 522 X ^23 X Sg 



X 



X 



'S'33 X 514 X 535 X X 537 X X ^41 X ^42 X 
543 X 546 X 54g X 550 X 551 X 552 X 554 X 556 X 
558 X 559 X 560 X 562 X 564 X 570 X 57i X 576 X 
579 X 582 X 595 X 5ii2 X 5i37 X 5i38 X 5i47 X 
5i67 X 5i70 X 5i94 X 5202 X 5216 X 5276 X 5318 X 
5356 X (C2 ; C2)2 

US Power Grid Cf^ x 5f x Sl'^ x 5| x 5| x 57 x 59 x (C2 ? C2 ) ^ 



Table 2: The geometric decomposition of the automorphism group of some real- world networks, and percentage 
basic factors (see section H}. In all cases, the automorphism group can be decomposed into direct and wreath 
products of symmetric groups. 

using the geometric decomposition is that it provides a computationally efficient way to calculate the struc- 
ture of large real-world networks and relates more intuitively to graph topology than the KruU-Schmidt 
factorization. For all the real-world networks we considered the automorphism group was factorized effi- 
ciently using this method, while a direct 'brute-force' factorization was not computationally feasible. 

The geometric decompositions of some real- world networks are given in Table [H In all cases the geometric 
factors are either symmetric groups or wreath products of symmetric groups (wreath products are a mild 
generalization of direct products, see [21] for a definition and examples). 

Remark: it is a result of Polya that automorphism groups of trees belong to the class of permutation 
groups which contains the symmetric groups and is closed under taking direct and wreath products [S]. 
Thus, the automorphism groups of many real-world networks belong to the same class of groups as the 
automorphism groups of trees. Note however, this does not necessarily mean that real-world symmetry is 
tree-like (for example, the complete graphs also belong to this class). In the following section we relate 
automorphism group structure to network topology in order to determine the extent to which real-world 
symmetry is, in fact, tree-like. 
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4 Automorphism Group Structure and Symmetric Subgraphs 



The induced subgraph on a set of vertices S' G 5^ is the graph obtained by taking S and any edges whose 
end points are both in S. We define a symmetric subgraph as the induced subgraph on the support of a 
geometric factor H (that is, on the points with non-trivial action by H). It is natural to ask whether there 
are any properties of symmetric subgraphs which are generic. 

From table [2] it is clear that most of the geometric factors found in real- world networks are isomorphic to 
Sn (for some n). Furthermore, almost all of these symmetric factors act transitively on their supports. We 
shall refer to transitive symmetric factors as basic factors and associated symmetric subgraphs as basic 
symmetric subgraphs (BSS's). We shall refer to all other factors as complex factors and their associated 
symmetric subgraphs as complex symmetric subgraphs. Table [2] shows that, in the cases we considered, 
almost all factors are basic and therefore that almost all symmetry is due to the presence of basic symmetric 
subgraphs. 

Since a graph 5^ on n vertices with Aut(^) = Sn is either empty or complete [H] it is immediate that 
BSS's are also either empty or complete. Furthermore, transitivity ensures that for a given BSS ^ and a 
given vertex v G ^ — all vertices in ^ are adjacent to v or none are. This means that most real-world 
symmetry is due to the presence of symmetric cliques (complete subgraphs invariant under Aut(^^)) and 
symmetric bicliques (complete bipartite subgraphs invariant under Aut(^^)). 

In practice, for all the real-world networks we considered, bicliques other than stars (a fc-star is a subgraph 
consisting of a vertex of degree k+ \ adjacent to k vertices of degree 1), although occasionally present, were 
rare (see Fig. [5] for some examples). In fact, we found that stars were the predominant symmetry structure 
present in all the networks we considered, although symmetric cliques were also significantly present in a 
number of networks. For example, the c. elegans genetic regulatory network[lB] - which was constructed 
by inferring connections from multiple datasets across multiple organisms and is thus arguably one of the 
most well-characterized biological networks available - contains multiple symmetric cliques, including one 
on 33 vertices corresponding to the largest geometric subgroup in the decomposition of its automorphism 
group. This example (and those in Fig. [2]) illustrate the fact that although much real-world symmetry 
is tree-like (and thus can be related to generic growth processes) a certain degree is not. In particular, a 
significant proportion of real-world symmetry originates in symmetric cliques. Since cliques and bicliques 
are topologically very similar (they are both complete multipartite graphs), the presence of symmetric 
cliques in complex networks may derive from similar growth processes to those that produce stars in 
combination with local clustering. 

Fig. [1] gives a typical arrangement of symmetric subgraphs (basic and complex) found in many real world 
networks, illustrating the relationship between these symmetric subgraphs and the structure of the network 
automorphism group. Since complex symmetric subgraphs can potentially take any form it is not possible 
to say anything general about their structure. However, since they are rare they may be considered on 
a case-by-case basis. Fig. [2] shows the complex symmetric subgraphs present in the US power grid, 
illustrating that in some real- world networks a certain degree of complex symmetry is present. 
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Figure 1: A typical arrangement of symmetric subgraphs. The geometric decomposition of the automor- 
phism group of this graph is Aut({#) = Ci x S3 x S4 x (C2 J C2). This example illustrates how different symmetric 
subgraphs contribute to the automorphism group, as well as showing common 'non-treelike' real-world symmetry. 
In particular note the 4-star (red) and the 3-clique (green) which correspond to the factors S'4 and S3 respectively in 
the geometric decomposition of Aut(5f). We found that wreath product factors generally associate with extended 
branches (see the far right of this figure), although this is not always the case (see the starred subgraph in Fig. [2] 
for example). Vertices are colored by orbit, fixed points are in white. 



5 Conclusions 



We have considered the automorphism groups of a variety of real-world networks and have found that a 
certain degree of symmetry is ubiquitous. We have constructed a practical decomposition of the automor- 
phism groups of these networks (the geometric decomposition), and found that the automorphism groups 
can typically be decomposed into direct and wreath products of symmetric groups. We have shown that 
each geometric factor can be associated with a symmetric subgraph, and demonstrated that most factors 
can be related to either a symmetric clique or symmetric biclique. Thus, we find that these two types of 
subgraph generically account for almost all real-world network symmetry. 

We anticipate that an important manifestation of real-world symmetry may be its effect on network 
behavior. For example, many biological regulatory networks are not simply static but rather can be 
associated with an underlying dynamical system: 

^ = MA.,x,), (2) 

where Xi is the state of the ith species (for example gene, protein etc.) and Aij is the network adjacency 
matrix. In many cases, while network topology may be known (from reverse engineering of microarray 
data, for instance) the specific form of the coupling functions fi are unknown or experimentally unverified 
(or unverifiable) . In these circumstances it is useful to know if there are any model independent properties 
of the dynamical system (that is properties which derive from network structure and are exhibited by a 
variety of coupling functions). It is here that symmetry is useful since, for equivariant dynamical systems 
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Figure 2: Complex symmetric subgraphs in the US power grid. Vertices in white correspond to those in 
the symmetric subgraphs. Vertices in black are those adjacent to those in the symmetric subgraph, and are shown 
to clarify subgraph structure. The starred subgraph has automorphism group C2IC2, illustrating that wreath 
products do not associate exclusively with extended branches such as in the example in Fig. [1] 

(those which remain 'unchanged' under the action of a given symmetry group), dynamic behavior can 
often be related in a generic way to symmetry structure. More precisely 'the symmetries of a system 
imply a 'catalogue' of typical forms of behavior from which the actual behavior is ' selected" Since the 
symmetry group of Eqs. ^ is necessarily a subgroup of the underlying network automorphism group, 
knowledge of the network automorphism group is a first step toward classifying typical forms of behavior 
that a given network may exhibit. This has been achieved for certain simple systems including those with 
Sn symmetryjSl 111) but has yet to be extensively investigated for more complex systems such as the ones 
we consider here. In light of these advances, and the fact that automorphism groups of real-world networks 
commonly have a rather simple generic form - they can be decomposed into direct and wreath products of 
symmetric groups - we anticipate that determination of a catalogue of forms of behavior for some complex 
networks may be feasible. 
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